Figure 1A 

M 

GGCACGAGGGGGCGGCGGCTGCGGGCGCAGAGCGGAG ATG 

LLLAAAVPT AP 
CTG CTG CTG GCG GCG GCG GTC CCC ACG GCC CCC 

PVKPGPALSYP 
CCA GTC AAG CCC GGC CCG GCT CTC AGC TAC CCG 

FREVEELMEDT 
TTC CGC GAG GTT GAG GAA CTG ATG GAG GAC ACG 

EMEAEEAAAKA 
GAG ATG GAG GCA GAA GAA GCT GCT GCT AAA GCA 

PPSYHNETNTD 
CCT CCC AGC TAT CAC AAT GAG ACC AAC ACA GAC 

VHREIHKITNN 
GTG CAC CGA GAA ATT CAC AAG ATA ACC AAC AAC 

TVITSVGDEEG 
ACA GTT ATC ACA TCT GTG GGA GAC GAA GAA GGC 

EDCGPSMYCQF 
GAG GAC TGT GGG CCC AGC ATG TAC TGC CAG TTT 

CRGQRMLCTRD 
TGC CGG GGC CAG AGG ATG CTC TGC ACC CGG GAC 

VWGHCTKM'ATR 
GTC TGG GGT CAC TGC ACC AAA ATG GCC ACC AGG 

QRDCQPGLCCA 
CAG AGG GAC TGC CAG CCG GGG CTG TGC TGT GCC 

CTPLPVEGELC 
TGC ACA CCC CTG CCC GTG GAG GGC GAG CTT TGC 

LITWELEPDGA 
CTC ATC ACC TGG GAG CTA GAG CCT GAT GGA GCC 

LLCQPHSHSLV 
CTC CTC TGC CAG CCC CAC AGC CAC AGC CTG GTG 

SRDQDGEILLP 
AGC CGT GAC CAA GAT GGG GAG ATC CTG CTG. CCC 

G SFMEEVRQEL 
GGC AGC TTC ATG GAG GAG GTG CGC CAG GAG CTG 

EMALGE PAAAA 
GAG ATG GCG CTG GGG GAG CCT GCG GCT GCC GCC 



QRLGATLLC 10 

CAG CGG CTT GGG GCC ACC CTG CTG TGC 67 

APAPTATSA 30 

GCG CCC GCT CCG ACG GCG ACC TCG GCT 12 7 

QEEATLNEM 50 

CAG GAG GAG GCC ACC CTC AAT GAG ATG 187 

QHKLRSAVE 70 

CAG CAC AAA TTG CGC AGC GCG GTG GAA 24 7 

SSEVNLANL 90 

TCA TCA GAA GTG AAC CTG GCA AAC TTA 3 07 

TKVGNNTIH 110 

ACG AAG GTT GGA AAT AAT ACC ATC CAT 3 67 

QTGQMVFSE 130 

CAG ACT GGA CAA ATG GTC TTT TCA GAG 42 7 

RRSHECI ID 150 

AGA AGG AGC CAC GAG TGC ATC ATC GAC 4 87 

ASFQYTCQP 170 

GCC AGC TTC CAG TAC ACC TGC CAG CCA 54 7 

SECCGDQLC 190 

AGT GAG TGC TGT GGA GAC CAG CTG TGT 607 

GSNGTICDN 210 

GGC AGC AAT GGG ACC ATC TGT GAC AAC 667 

FQRGLLFPV 230 

TTC CAG AGA GGC CTG CTG TTC CCT GTG 72 7 

HDPASRLLD 250 

CAT GAC CCC GCC AGC CGG CTT CTG GAC 7 87 

LDRCPCASG 270 

TTG GAC CGA TGC CCT TGT GCC AGT GGC 84 7 

YVCKPTFVG 290 

TAT GTG TGC AAG CCG ACC TTC GTG GGG 907 

REV PDEYEV 310 

AGA GAG GTC CCC GAT GAG TAT GAA GTT 967 

EDLERSLTE 330 

GAG GAC CTG GAG AGG AGC CTG ACT GAA 102 7 

A ALLGGEE I 350 

GCT GCA CTG CTG GGA GGG GAA GAG ATT 1087 
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Figure IB 

AT CTGGAC CAGG CTG TGGGTAGATGTG CAATAGAAATAG CT AATTTATT T C C C CANG TGTGTG CTTTAAG CG TGGG CTG 116 9 

ACCAGGCTTCTTCCTACATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAG 124 8 

CTCCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCAGGAGCAGTTTGC 13 2 7 

CACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCGTTTGTTCTACATGGCTTTGATAATTG 14 06 

TTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTCTGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTT 14 8 5 

TGCAAACATCAACCTGGCAAAAATGCAACAAATGAATTTTCGACGCAGTTCTT^ 1564 

TCAGCTGTTGCAGATGAAATGTTCTGTTCACCCTGC^ 164 3 

CTCTGTGCCAGGGC^GCATTTTCATATCCAAGATCTVATTCCCTCTCTCAGCACAGCCTGGGGAGGGGGTC^TTGTTCT^ 172 2 

CT CGT C CAT CAGGGATTT CAGAGG CTCAGAGACTG CAAG CTG C TTG C C CAAGT CACACAG CTAG TGAAGAC CAGAG CAG 18 01 

TTTCATCTGGTTGTGACTCTAAGCTC^GTGCTCTCTCC^CTACCCCACACCAGCCTTGGTGCCACC^W^AGTGCTCCCC 18 8 0 

AAAAGGAAGGAGAATGGG AT TTT T C TTTTG AGG CATG CACAT C TGGAAT T AAGG T CAAAC TAATT CT CACAT C C CT CTA 1959 

AAAGTAAACTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGC^GCCGTCCTTCTAATGAAGACAATGATATTGAC 2 03 8 

ACTGTCCCTCTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCATACAGGTTAACCTGCAGAAA 2117 

CAGTACT TAGGTAATTGTAGGG CGAGG ATTATAAATGAAATT TG CAAAAT CACTT AG CAG CAACTGAAGACAAT TAT CA 2196 

AC CACG TGGAG AAAATCAAAC CGAG CAGGG C TG TGTGAAACATGG TTGTAATATG CGAC TG CGAACACTGAAC T CTACG 22 75 

CCACTCCACAAATGATGTTTTCAGGTGTCATGGACTGTTGCCAC CATG TAT T CAT CCAGAGTTCTTAAAG TTT AAAGTT 2 3 54 

G CACATGATTG TATAAG CATG CT TT CT TTGAGT T TTAAATTATG TAT AAACATAAG T TG CAT T TAGAAAT CAAG CATAA 24 33 

AT CACTT CAACTGCTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 24 7 9 
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GAATT CGG CACGAG AGACGACGTG CTGAG C TGC CAG C TTAG TGGAAG CT CTG CT CTGGGTGG AGAG GAG C C T CG CTT TG 7 9 

MVAAVLLG 8 

GTGACG CACAGTG C TGGGAC C CT C CAGGAG C CC CGGG ATTGAAGG ATG GTG GCG GCC GTC CTG CTG GGG 14 8- 

LSWLCSPLGALVLDFNNIRS 28 

CTG AGC TGG CTC TGC TCT CCC CTG GGA GCT CTG GTC CTG GAC TTC AAC AAC ATC AGG AGC 208 

SADLHGARK GSQCLSDTDCN 48 

TCT GCT GAC CTG CAT GGG GCC CGG AAG GGC TCA CAG TGC CTG TCT GAC ACG GAC TGC AAT 268 

T R K F C L Q P • R D E K P F CAT C R G 68 

ACC AGA AAG TTC TGC CTC CAG CCC CGC GAT GAG AAG CCG TTC TGT GCT ACA TGT CGT GGG 32 8 

LRRRCQRDAMCCPGTLCVND 88 

TTG CGG AGG AGG TGC CAG CGA GAT GCC ATG TGC TGC CCT GGG ACA CTC TGT GTG AAC GAT 3 88 

VCTTMEDATP I LERQLDEQD 108 

GTT TGT ACT ACG ATG GAA GAT GCA ACC CCA ATA TTA GAA AGG CAG CTT GAT GAG CAA GAT 44 8 

GTHAEGTTGHPVQENQ PKRK 128 

GGC ACA CAT GCA GAA GGA ACA ACT GGG CAC CCA GTC CAG GAA AAC CAA CCC AAA AGG AAG 50 8 

PSIKKSQGRKGQEGESCLRT 148 

CCA AGT ATT AAG AAA TCA CAA GGC AGG AAG GGA CAA GAG GGA GAA AGT TGT CTG AGA ACT 56 8 

FDCGPGL 'CCARHFWTK I CKP 168 

TTT GAC TGT GGC CCT GGA CTT TGC TGT GCT CGT CAT TTT TGG ACG AAA ATT TGT AAG CCA. 628 

VLLEGQVCS RRGHKDTAQAP 188 

GTC CTT TTG GAG GGA CAG GTC TGC TCC AGA AGA GGG CAT AAA GAC ACT GCT CAA GCT CCA 688 

EIFQRCDCGPGLLCRSQLTS 208 

GAA ATC TTC CAG CGT TGC GAC TGT GGC CCT GGA CTA CTG TGT CGA AGC CAA TTG ACC AGC 74 8 

NRQHARLRVCQKIEK'L* 225 

AAT CGG CAG CAT GCT CGA TTA AGA GTA TGC CAA AAA ATA GAA AAG CTA TAA 799 



AT ATT T CAAAATAAAGAAGAATC CACATTG CAAAAAAAAAAAAAAAAAA 



848 




Figure 3 



GTCGACCCACGCGTCCGCGGACGCGTGGGCGGCACGGTTTCGTGGGGACCCAGGCTTGCAAAGTGACGGTCATTTTCTC 7 9 

MM.ALGAAGATRV 12 

TTTCTTTCTCCCTCTTGAGTCCTTCTGAG ATG ATG GCT CTG GGC GCA GCG GGA GCT ACC CGG GTC 144 

FVAMVAAALGGHPLLGVSAT 32 

TTT GTC GCG ATG GTA GCG GCG GCT CTC GGC GGC CAC CCT CTG CTG GGA GTG AGC GCC ACC 204 

LNSVLNSNAIKNLP PPLGGA 52 

TTG AAC TCG GTT CTC AAT TCC AAC GCT ATC AAG AAC CTG CCC CCA CCG CTG GGC GGC GCT 2 64 

AG HPGSAVSAAPGI LYPGGN 72 

GCG GGG CAC CCA GGC TCT GCA GTC AGC GCC GCG CCG GGA ATC CTG TAC CCG GGC GGG AAT 324 

KYQT I DNYQPYPCAEDEECG 92 

AAG TAC CAG ACC ATT GAC AAC TAC CAG CCG TAC CCG TGC GCA GAG GAC GAG GAG TGC GGC 384 

TDEYCASPTRGGDAGVQICL 112 

ACT GAT GAG TAC TGC GCT AGT CCC ACC CGC GGA GGG GAC GCA GGC GTG CAA ATC TGT CTC 444 

ACRKRRKRCMRHAMCCPGNY 132 

GCC TGC AGG AAG CGC CGA AAA CGC TGC ATG CGT CAC GCT ATG TGC TGC CCC GGG AAT TAC 504 

CKNGICVS S DQNHFRGEIEE 152 

TGC AAA AAT' GGA ATA TGT GTG TCT TCT GAT CAA AAT CAT TTC CGA GGA GAA ATT GAG GAA 564 

TITESFGNDHSTL DGYSRRT 172 

ACC ATC ACT GAA AGC TTT GGT AAT GAT CAT AGC ACC TTG GAT GGG TAT TCC AGA AGA ACC 62 4 

TLSSKMYHTKGQEGSVCLRS 192 

ACC TTG TCT TCA AAA ATG TAT CAC ACC AAA GGA CAA GAA GGT TCT GTT TGT CTC CGG TCA 684 

SDCASGLCCARHFWSKICKP 212 

TCA GAC TGT GCC TCA GGA TTG TGT TGT GCT AGA CAC TTC TGG TCC AAG ATC TGT AAA CCT 74 4 

VLKEGQVCTKHRRKG SHGLE 232 

GTC CTG AAA GAA GGT CAA GTG TGT ACC AAG CAT AGG AGA AAA GGC TCT CAT GGA CTA GAA 804 

IFQRCYCGEGLSCRIQKDHH 252 

ATA TTC CAG CGT TGT TAC TGT GGA GAA GGT CTG TCT TGC CGG ATA CAG AAA GAT CAC CAT 8 64 

QASNSSRLHTCQRH* 267 
CAA GCC AGT AAT TCT TCT AGG CTT CAC ACT TGT CAG AGA CAC TAA 909 

ACCAGCTATCCAAATGCAGTGAACTCCTTTTATATAATAGATGCTATGAAAACCTTTTATGACCTTCATCAACTCAATC 98 8 

CTAAGGATATACAAGTTCTGTGGTTTCAGTTAAGCATTCCAATAACACCTTCCAAAAACCTGGAGTGTAAGAGCTTTGT 1067 

TTCTTTATGGAACTCCCCTGTGATTGCAGTAAATTACTGTATTGTAAATTCTCAGTGTGGCACTTACCTGTAAATGCAA 114 6 

TGAAACTTTTAATTATTTTTCTAAAGGTGCTGCACTGCCTATTTTTCCTCTTGTTATGTAAATTTTTGTACACATTGAT 1225 

TGTT ATCTTGACTGACAAATATTCTATATTGAACTGAAGTAAATCATTTCAGCTT ATAGTTCTTAAAAGCATAACCCTT 1304 

TACCCCATTTAATTCTAGAGTCTAGAACGCAAGGATCTCTTGGAATGACAAATGATAGGTACCTAAAATGTAACATGAA 1383 

AATACTAGCTTATTTTCTGAAATGTACTATCTTAATGCTTAAATTATATTTCCCTTTAGGCTGTGATAGTTTTTGAAAT 14 62 
AAAATTTAACATTTAATATCATGAAATGTTATAAGTAGACATAAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGC 1536 
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GTCGACCCACGCGTCCGGCGGGAGCCCGCGGCGAGCGTAGCGCAAGTCCGCTCCCTAGGCATCGCTGCGCTGGCAGCGA 7 9 

TTCGCTGTCTCTTGTGAGTCAGGGGACAACGCTTCGGGGCAACTGTGAGTGCGCGTGTGGGGGACCTCGATTCTCTTCA 158 

GATCTCGAGGATTCGGTCCGGGGACGTCTCCTGATCCCCTACTAAAGCGCCTGCTAACTTTGAAAAGGAGCACTGTGTC 237 

CTGCAAAGTTTGACACATAAAGGATAGGAAAAGAGAGGAGAGAAAAGCAACTGAGTTGAAGGAGAAGGAGCTGATGCGG 316 

GCCTCCTGATCAATTAAGAGGAGAGTTAAACCGCCGAGATCCCGGCGGGACCAAGGAGGTGCGGGGCAAGAAGGAACGG 395 

AAGCGGTGCGATCCACAGGGCTGGGTTTTCTTGCACCTTGGGTCACGCCTCCTTGGCGAGAAAGCGCCTCGCATTTGAT 4 7 4 

TGCTTCCAGTTATTGCAGAACTTCCTGTCCTGGTGGAGAAGCGGGTCTCGCTTGGGTTCCGCTAATTTCTGTCCTGAGG 553 

CGTGAGACTGAGTTCATAGGGTCCTGGGTCCCCGAACCAGGAAGGGTTGAGGGAACACAATCTGCAAGCCCCCGCGACC 632 

CAAGTGAGGGGCCCCGTGTTGGGGTCCTCCCTCCCTTTGCATTCCCACCCCTCCGGGCTTTGCGTCTTCCTGGGGACCC 711 

MAALMRSKDSSCCLLLL17 
CCTCGCCGGGAG ATG GCC GCG TTG ATG CGG AGC AAG GAT TCG TCC TGC TGC CTG CTC CTA CTG 77 4 

AAVLMVE S SQIGSSRAKLNS 37 

GCC GCG GTG CTG ATG GTG GAG AGC TCA CAG ATC GGC AGT TCG CGG GCC AAA CTC AAC TCC 834 

IKSSLGGETPGQAANRSAGM 57 

ATC AAG TCC TCT CTG GGC GGG GAG ACG CCT GGT CAG GCC GCC AAT CGA TCT GCG GGC ATG 8 94 

YQGLAFGGSKKGKNLGQAYP 77 

TAC CAA GGA CTG GCA TTC GGC GGC AGT AAG AAG GGC AAA AAC CTG GGG CAG GCC TAG CCT 954 

CSSDKECEVGRYCHSPHQGS 97 

TGT AGC AGT GAT AAG GAG TGT GAA GTT GGG AGG TAT TGC CAC AGT CCC CAC CAA GGA TCA 1014 

SACMVCRRKKKRCHRDGMCC 117 

TCG GCC TGC ATG GTG TGT CGG AGA AAA AAG AAG CGC TGC CAC CGA GAT GGC ATG TGC TGC 107 4 

PSTRCNNGICIPVTESILTP 137 

CCC AGT ACC CGC TGC AAT AAT GGC ATC TGT ATC CCA GTT ACT GAA AGC ATC TTA ACC CCT 1134 

HI PALDGTRHRDRNHGHYSN 157 

CAC ATC CCG GCT CTG GAT GGT ACT CGG CAC AGA GAT CGA AAC CAC GGT CAT TAC TCA AAC 1194 

HDLGWQN LGRPHTKMSHIKG 177 

CAT GAC TTG GGA TGG CAG AAT CTA GGA AGA CCA CAC ACT AAG ATG TCA CAT ATA AAA GGG 1254 

HEGDPCLRSSDCIEGFCCAR 197 

CAT GAA GGA GAC CCC TGC CTA CGA TCA TCA GAC TGC ATT GAA GGG TTT TGC TGT GCT CGT 1314 

HFWTKI CKPVLHQGEVCTKQ 217 

CAT TTC TGG ACC AAA ATC TGC AAA CCA GTG CTC CAT CAG GGG GAA GTC TGT ACC AAA CAA 137 4 

RKKG S HG LE I FQRCDCAKGL 237 

CGC AAG AAG GGT TCT CAT GGG CTG GAA ATT TTC CAG CGT TGC GAC TGT GCG AAG GGC CTG 14 34 

SCKVWKDATYSSKARLHVCQ 257 

TCT TGC AAA GTA TGG AAA GAT GCC ACC TAC TCC TCC AAA GCC AGA CTC CAT GTG TGT CAG 14 94 

K I * 260 
AAA ATT TGA 1503 

T CAC CAT T G AGG AAC AT CAT C AAT T G CAG AC T G T G AAG T T GT G T AT T T AAT G CAT TAT AG C AT G G T GG AAAAT AAG GT T 1582 

CAGATGCAGAAGAATGGCTAAAATAAGAAACGTGATAAGAATATAGATGATCACAAAAAGGGAGAAAGAAAACATGAAC 1661 




TGAATAGATTAGAATGGGTGACAAATGCAGTGCAGCCAGTGTTTCCATT ATGCAACTTGTCTATGTAAATAATGTACAC 17 4 0 




Figure 4B 

AT T T GT G G AAAAT G C T AT T AT T AAG AG AAC AAG C AC AC AG T GG AAAT T ACT GAT G AGT AGC AT GT G AC T T T C C AAG AGT 1819 
TTAGGTTGTGCTGGAGGAGAGGTTTCCTTCAGATTGCTGATTGCTTATACAAATAACCTACATGCCAGATTTCTATTCA 18 98 
ACGTT AG AGTT T AAC AAAAT ACT C CT AG AATAACTTGT TAT AC AATAGGTTCTAAAAAT AAAAT TGCTAAAC AAG AAAT 1977 
GAAAACATGGAGCATTGTTAATTTACAACAGAAAATTACCTTTTGATTTGTAACACTACTTCTGCTGTTCAATCAAGAG 2056 
TCTTGGTAGATAAGAAAAAAATCAGTCAATATTTCCAAATAATTGCAAAATAATGGCCAGTTGTTTAGGAAGGCCTTTA 2135 
GGAAGACAAATAAATAACAAACAAACAGCCACAAATACTTTTTTTTCAAAATTTTAGTTTTACCTGTAATTAATAAGAA 2214 
CTGATACAAGACAAAAACAGTTCCTTCAGATTCTACGGAATGACAGTATATCTCTCTTTATCCTATGTGATTCCTGCTC 22 93 
T G AAT G CAT T AT AT T T T C C AAAGT AT AC C CAT AAAT T GT G AC T AGT AAAAT ACT T AC AC AG AG C AG AAT T T T C AC AG AT 2372 
GG C AAAAAAAT T T AAAG AT G T C C AAT AT AT G T G G G AAAAG AG C T AAC AG AG AG AT CAT TAT T T C T T AAAG AT T G G C CAT 24 51 
AAC C T G TAT T T T GAT AG AAT TAG AT T G G T AAAT AC AT G T AT T CAT AC AT AC T C T G T G GT AAT AG AG AC T T GAG C T G GAT 2 530 
CTGTACTGCACTGGAGTAAGCAAGAAAATTGGGAAAACTTTTTCGTTTGTTCAGGTTTTGGCAACACATAGATCATATG 2 60 9' 
TCTGAGGCACAAGTTGGCTGTTCATCTTTGAAACCAGGGGATGCACAGTCTAAATGAATATCTGCATGGGATTTGCTAT- 2 688 
CATAATNTTTCCTATGCNGNTGAATTCNGTGTGAGGTCCTGTGTCCGTCCTATCCTCAAATTATTTATTTTATAGTGCT 27 67 
GAGATCCTCAAATAATCTCAATTTCGGAGGTTTCACAATVATGGACTCCTGAAGTAGACAGAGTAGTGAGGTTTCATTGC 28 4 6 
CCTCTATAAGCTTCTGACTAGCCAATGGCATCATCCAATTTTCTTCCCAAACCTCTGCAGCATCTGCTTTATTGCCAAA 2 925 
GGGCTAGTTTCGGTTTTCTGCCAGCCATTGCGGTTAAAAAATATAAGTAGGATAACTTGTAAAACCTGCATATTGCTAA 3004 
TCTATAGACACCACAGTTTCTAAATTCTTTGAAACCACTTTACTACTTTTTTTAAACTTAACTCAGTTCTAAATACTTT 3083 
GTCTGGAGCACAAAACAATAAAAGGTTATCTTATAGTTGTGACTTTAAACTTTTGTAGACCACAATTCACTTTTTAGTT 3 1 62 
TTCTTTTACTTAAATCCCATCTGCAGTCTCAAATTTAAGTTCTCCCAGTAGAGATTGAGTTTGAGCCTGTATATCTATT 32 41 
AAAAAT T T C AAC T T C C C AC AT AT AT T T AC T AAG AT G AT T AAG AC T T AC AT T T T C T G C AC AGG T C T G C AAAAAC AAAAAT 3320 
TATAAACTAGTCCATCCAAGAACCAAAGTTTGTATAAACAGGTTGCTATAAGCTTGGTGAAATGAAAATGGAACATTTC 3399 
AATCAAACATTTCCTATATAACAATTATTATATTTACAATTTGGTTTCTGCAATATTTTTCTTATGTCCACCCTTTTAA 34 7 8 
AAAT T AT T AT T T G AAG T AAT T TAT T T AC AG G AAAT G T T AAT GAG AT GT AT T T T C T T AT AG AG AT AT T T C T T AC AG AAAG 3557 
C T T T G T AGC AG AAT AT AT T T G C AG C T AT T G AC T T T G T AAT T T AGG AAAAAT G TAT AAT AAG AT AAAAT C TAT T AAAT T T 3 63 6 
TTCTCCTCTAAAAACTGAAAAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGC 3 687 
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FGTCGACCCACGCGTCCGCTGTGGCAGCCCAGCTACCGGTCGTGACCAGATCCAGCTTGCAGCTCAGCTTTGTTCATTC 7 9 

MQRLGG ILLCTL 12 

GAATTGGGCGGCGGCCAGCGCGGAACAAAC- ATG GAG CGG CTC GGG GGT ATT TTG CTG TGT ACA CTG 145 

LAAAV P TA PA P S P T VTW T PA 32 

CTG GCG GCG GCG GTC CCC ACT GCT CCT GCT CCT TCC CCG ACG GTC ACT TGG ACT CCG GCG 205 



EPGPALNYPQEEATLNEMFR 52 

GAG CCG GGC CCA GCT CTC AAC TAC CCT CAG .GAG GAA GCT ACG CTC AAT GAG ATG TTT CGA 265 

EVEELMEDTQHKLRSAVEEM 72 

GAG GTG GAG GAG CTG ATG GAA GAC ACT CAG CAC AAA CTG CGC AGT GCC GTG GAG GAG ATG 325 



E A E E A A A 

GAG GCG GAA GAA GCA GCT GCT 

N Y H N E T S 

AAC TAT CAC AAT GAG ACC AGC 



K T S S E V N 

AAA ACG TCC TCT GAG GTG AAC 

T E T R V G N 

ACG GAG ACC AGG GTG GGA AAT 



L A S L P P 92 

CTG GCA AGC TTA CCT CCC 3 85 

N T V H ' V H 112 

AAC ACA GTC CAT GTG CAC 44 5 



in QEVHKITNNQ 

*!? CAG GAA GTT CAC AAG ATA ACC AAC AAC CAG 

[U ITSVGDEEGK 

CP ATT ACA TCT GTA GGG GAT GAA GAA GGC AAG 

Q CGPTRYCQFS 

fjj TGT GGG CCC -ACC AGG TAC TGC CAG TTC TCC 

iyi DQQMLCTRDS 

£ _ GAC CAG CAG ATG CTA TGC ACC CGA GAC AGT 

\d GHC TQKATKG 

f== GGT CAC TGC ACC CAA AAG GCC ACC AAA GGT 

^L! DCQPGLCCAF 

^jf GAT TGC CAG CCT GGC CTG TGT TGT GCC TTC 

PLPVEGELCH 
CCC CTG CCC GTG GAG GGA GAG CTC TGC CAT 



SGQVVFSETV 132 

AGT GGA CAG GTG GTC TTT TCT GAG ACA GTC 505 

RSHECIIDED 152 

AGG AGC CAT GAA TGT ATC ATT GAT GAA GAC 565 

SFKYTCQPCR 172 

AGC TTC AAG TAC ACC TGC CAG CCA TGC CGG 62 5 

ECCGDQLCAW 192 

GAG TGC TGT GGA GAC CAG CTG TGT GCC TGG 685 

GNGTICD-NQR 212 

GGC AAT GGG ACC ATC TGT GAC AAC CAG AGG 74 5 

QRGLLFPVCT 232 

CAA AGA GGC CTG CTG TTC CCC GTG TGC ACA 805 

DPTS QLLDLI 252 

GAC CCC ACC AGC CAG CTG CTG GAT CTC ATC 86 5 



TWELEPEGALDRCPCASGLL 272 

ACC TGG GAA CTG GAG CCT GAA GGA GCT TTG GAC CGA TGC CCC TGC GCC AGT GGC CTC CTA 925 

CQPHSHS LVYMCKPAFVGSH 292 

TGC CAG CCA CAC AGC CAC AGT CTG GTG TAC ATG TGC AAG CCA GCC TTC GTG GGC AGC CAT 985 



DHSEESQLPRE 
GAC CAC AGT GAG GAG AGC CAG CTG CCC AGG GAG 

FIGEVRQELED 
TTC ATA GGG GAA GTG CGC CAG GAG CTG GAA GAC 



APDEYEDVG 312 

GCC CCG GAT GAG TAC GAA GAT GTT GGC 104 5 

LERSLAQEM 332 

CTG GAG CGG AGC CTA GCC CAG GAG ATG 1105 
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AFEGPAPVESLGGEEE I * 
GCA TTT GAG GGG CCT GCC CCT GTG GAG TCA CTA GGC GGA GAG GAG GAG ATT TAG 
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GCC CAGAC CCAG CTGAGTCACTGGTAGATG TG CAATAGAAATGG CTAATTTAT TTT C C CAGGAGTGT CC C CAAG TGTGG 1238 

AATGGCCGCAGCTCCTTCCCAGTAGCTTTTCCTCTGGCTTGACAAGGTACAGTGCAGTACATTTCTTCCAGCCGCCCTG 1317 

CTTCTCTGACTTGGGAAAGACAGGCATGGCGGGTAAGGGCAGCGGTGAGTCGTCCCTCGCTGTTGCTAGAAACGCTGTC 13 96 

TTGTTCTTCATGGATGGAAGATTTGTTTGAAGGGAGAGGATGGGAAGGGGTGAAGTCTGCTCATGATGGATTTGGGGGA 14 75 

TACAGGGAGGAGGATGCCTGCCTTGCAGACGTGGACTTGGCAAAATGTAACCTTTGCTTTTGTCTTGCGCCGCTCCCAT 15 54 

GGGCTGAGGCAGTGGCTACACAAGAGCTATGCTGCTCTGTGGCCTCCCACATATTCATCCCTGTGTTTCAGCTCCTACC 1633 

TCACTGTCAGCACAGCCCTTCATAGCCACGCCCCCTCTTGCTCACCACAGCCTAGGAGGGGACCAGAGGGGACTTCTCT 1712 

CAGAGC C C CATG CTC T CT C T CT CAAC C C CATAC CAG C CT C TG TG C CAG CGACAGT CCTT C CAAATGGAGGGAGTGAAAT 17 91 

CCTTTGGTTTAATTATTTTCTCCTTCAAGGCACGCCTGCCACTAAGGTCAGGCTGACTTGCATGTCCCTCTAACGTTCG 187 0 

TAGCAGTGTGGTGGACACTGTCTTCCACCGACTGCTTCAATACCTCTGAAAGCCAGTGCTCGGAGTGCAGTTCGTGTAA 194 9. 

ATTAAT TTG CAGG AAGTATACTTGGCTAATTG T AGGG C TAGGATTG TG AATG AAAT T TG CAAAG T CG CTTAGCAACAAT 2 02 8 

GGAAAG CC T TT CT CAGT CACAC CGAGAAGT CACAAC CAAG C CAGGT TGTGTAG AGTACAG CTGTG ACATACAGACAGAA 2107 

GAAGG C TGGG CTGGATG T CAGG CCT CAG ATG ACGGT T T CAGGTG C CAGGAAC TAT TAC CATT C TG TAT CTAT C CAGAGT 2186 

TATTAAAATTGAAAGTTGCACACATTTGTATAAGCATGCCTTTCTCCTGAGTT 2 2 6 5 

TGGCCCTCAAAGATGATGCACAAACGACTACTCTTTGCTAATTCTTGGACTTTTCTCTTTGATTTTCA 2 3 44 

TC C C CT T CATG CAAAAAAAAAAAAAAAGGG CGG CCG C 2 3 81 
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61 _ _ 120 

hdkk-l GAAGATRVFV AMVAAALGGH PLL . . . G . . V SATLNSVL . . NSNAIKNLP. PPLGGAAGHP 

mdkk-1 CAPAAVRFLA VFTMMALCSL Pf.T G..A SA TLNSVr.T . NSNAIKNLP. PPLGGAGGQP 

xdkk-1 — MGSNMFPV PLIVFWG FIL DGALG FVMMT NSNSIKNVPA APAGQPIGY. 

hdkk-2 MA ALMRSKDSSC CLLLLAAVLM . . . VESSOTfi SRRAt fT.MSTTf SSLGGET . . P 

hdkk-3 EVEELMEDTQ HKLRSAVEEM E . AEEAAAKA SSEVNLANLP PSYHNETNTD TKVGNNT I H V 

mdkk-3 EVEELMEDTQ HKLRSAVEEM E . AEEAAAKT SSEVNLASLP PNYHNETSTE TRVGNNTVHV 

Cdkk-3 EVEALMEDTQ HKLRNAVQEM E . AEEEGAKK LSEVNFENLP PT YHNE SNTE TRIGNKTVQT 

hdkk-4 MVAAVLLGr, 
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SWLC SP 



APGILYPG. . 
APGVLYEG . . 
SPDSLYDI . . 
SAG . MYQGLA 
QTGQMVFSET 
QSGQWFSET 
RTGSTIFSET 
LGA LVLDFNN 



. GNKYQTIDN 
. GNKYQTLDN 
. ANKYQPLDA 
FGGSKKGKNL 
VITSVGDEEG 
VITSVGDEEG 
IITSIKGGEN 
IRSSADLHGA 



RRSHECI 
KRSHECI 
KRNHECI 



YQPYPCAEDE 
YQPYPCAEDE 
YPLYSCTEDD 
GQAYPCSSDK 
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RKGSqCLSDT 



ECGTDEYCAS 
ECGSDEYCSS 
DCALDEFCHS 
ECEVGRYCHS 
DCGPSMYC . . 
DCGPTRYC. . 
DCETGKYC . . 
DCNTRKFCLQ 
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PTRG . . GDAG 
PSRGAAGVGG 
SRNGNS .... 
PHQGSSA. . . 

QFASF 

QFSSF 

QFSTF 

PRDEKP . . . . 
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| VQICLACRKR 
[ VQICLACRKR 
. LVCLACRKR 
1 . . . CMVCRRK 
QYTCQPCRGQ 
KYTCQPCRDQ 
EYKCQPCKTQ 
. . FCATCRGL 



RKRCMRHAMC 
RKRCMTHAMC 
RKRCLRDAMC 
KKRCHRDGMC 
RMLCTRDSEC 
QMLCTRDSEC 
HTHCSRDVEC 
RRRCQRDAMC 



C PGNYCKNG I 
CPGNYCKNGI 
CTGNYCSNGI 
CPSTRCNNGI 
CGDQLCVWGH 
CGDQLCAWGH 
CGDQLCVWGE 
CPGTLCVNDV 
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C^P..SDHSH FP.RGEIEES 
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ITESFGN.DH 
IIENLGN.DH 
I LENYNNADH 



CVPVEQDQER FQHQGYLEET 
CIPV.TESIL TPHIPALDGT RHRD . RNHGH 

CTKMAT 

CTQKAT 

CRKATS 

CJITME - DATP ILERQLDEQD GTHAEGTTGH 



hdkk-l 
mdkk-1 
xdkk-1 
hdkk-2 
hdkk-3 
mdkk-3 
cdkk-3 
hdkk-4 



hdkk-l 
mdkk-1 
xdkk-1 
hdkk-2 
hdkk-3 
mdkk-3 
cdkk-3 
hdkk-4 



hdkk-l 
-ndkk-1 
xdkk-1 
hdkk-2 
hdkk-3 
mdkk-3 
cdkk-3 
hdkk-4 



241 

STL . . DGYSR RTTLSSKMYH 

NAAAGDGYPR RTTLT*§KjlYH 

ATM . . DTHSK LTTSPSGMQP 

YSNHDLGWQN LGRPHT^MSH 



PV. . QENQPK RKPSIKKlSQG 
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ILLPREVPDE YEVGSFMEEV RQELEDLERS LTEEMALGEP AAAAAALLGGEE I - 

SQLPREAPDE YEDVGFIGEV RQELEDLERS LAQEMAFEGP APVES . . LGGEEEI 

PLNMDEMPFI SLIPRDILSD YEESSVIQEV RKELESLE . . . DQAGVKSEH DPAHDLFLGDEI — 



Figure 7 



M 1 

CTCGAGG C CAAAATT CGG CACGAGG C CGGG CTGTGGT C TAG CATAAAGG CGGAG C CCAGAAGAAGGGG CGGGGT ATG 7 7 

GEASPPAPARRHL LVLLLLL 21 

GGA GAA GCC TCC CCA CCT GCC CCC GCA AGG CGG CAT CTG CTG GTC CTG CTG CTG CTC CTC 13 7 

S TLV I P S AAA P I H DADAQ E S 41 

TCT ACC CTG GTG ATC CCC TCC GCT GCA GCT CCT ATC CAT GAT GCT GAC GCC CAA GAG AGC 197 

SLGLTGLQSLLQGFSRLF-LK 61 

TCC TTG GGT CTC ACA GGC CTC GAG AGC CTA CTC CAA GGC TTC AGC CGA CTT TTC CTG AAA 257 

GNLLRG IDSLFSAPMDFRGL 81 

GGT AAC CTG CTT CGG GGC ATA GAC AGC TTA TTC TCT GCC CCC ATG GAC TTC CGG GGC CTC 317 

PGNYHKEENQEHQLGNNTLS 101 

CCT GGG AAC TAC CAC AAA GAG GAG AAC CAG GAG CAC- CAG CTG GGG AAC AAC ACC CTC TCC 3 77 

SHLQIDKMTDNKTGEVLISE 121 

AGC CAC CTC CAG ATC GAC AAG ATG ACC GAC AAC AAG ACA GGA GAG GTG CTG ATC TCC GAG 43 7 

NVVAS IQPAEGSFEGDLKVP 141 

AAT GTG GTG GCA TCC ATT CAA CCA GCG GAG GGG AGC TTC GAG GGT GAT TTG AAG GTA CCC 4 97 

RMEEKEALVPIQKATDSFHT 161 

AGG ATG GAG GAG AAG GAG GCC CTG GTA CCC ATC CAG AAG GCC ACG GAC AGC TTC CAC ACA 557 

ELHPRVAFWI I KLPRRR SHQ 181 

GAA CTC CAT CCC CGG GTG GCC TTC TGG ATC ATT AAG CTG CCA CGG CGG AGG TCC CAC CAG 617 

DALEGGHWLSEKRHRLQAIR 201 

GAT GCC CTG GAG GGC GGC CAC TGG CTC AGC GAG AAG CGA CAC CGC CTG CAG GCC ATC CGG 6 77 

DGLRKGTHKDVLEEGTESSS 221 

GAT GGA CTC CGC AAG GGG ACC CAC AAG GAC GTC CTA GAA GAG GGG ACC GAG AGC TCC TCC 73 7 

HSRLSPRKTHLLYILRPSRQ 241 

CAC TCC AGG CTG TCC CCC CGA AAG ACC CAC TTA CTG TAC ATC CTC AGG CCC TCT CGG CAG 797 

L * 243 

CTG TAG 803 

GGGTGGGGACCGGGGAGCACCTGCCTGTAGCCCCCATCAGACCCTGCCCCAAGCACCATATGGAAATAAAGTTCTTTCT 882 

TACATCTAAAAAAAAAAAAAAAAAAAAAAAAAAAATTGGCGGCCGC 92 8 
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Figure 8 



M C R L R 5 

G AAT T C G G C AC G AGG CAG AAG G C G C G AAT G AAG G C AAAG C C T C C C AC CCACCTGCA ATG TGT CGA CTG AGG 71 

VLLLLLPLAF V SSSA LPIHD 25 

GTC TTG CTG CTG CTG CTC CCC TTG GCC TTC GTG TCC TCC TCT GCT CTC CCC ATC CAT GAT 131 

VDSQQNTSG FLGLQRLLQSF 45 

GTC GAC TCT CAG CAG AAC ACC TCC GGG TTC CTG GGC CTT CAG AGG CTT CTC CAA AGC TTT 161 

SRLFLKNDLLRDLDNFFSSP 65 

AGT CGA CTG TTC CTA AAA AAT GAC CTG CTA CGA GAC CTG GAC AAC TTC TTC TCC TCC CCC 251 

MDFRDLPRN FHQEENQEHRM 85 

ATG GAC TTC CGA GAC CTT CCT AGG AAC TTC CAT CAG GAA GAG AAC CAG GAG CAC AGA ATG 311 

GNHTLSSHLQI DKVT DNQTG 105 

GGC AAC CAT ACC CTC TCC AGC CAC CTA CAG ATA GAC AAG GTG ACT GAC AAC CAG ACA GGG 371 

EVHI SEKVEASIEPERNPEG 125 

GAG GTG CAC ATC TCG GAG AAA GTC GAG GCC TCC ATT GAG CCA GAA CGG AAC CCG GAA GGG 4 31 

DWKVPKVEAKEPPVPVQKVT 145 

GAC TGG AAG GTT CCC AAA GTA GAA GCA AAA GAG CCC CCG GTG CCT GTG CAG AAG GTC ACC 4 91 

DSLHPEPRQV AFWIMKMPRR. 165 

GAC AGC TTG CAC CCA GAG CCC CGG CAG GTG GCT TTC TGG ATC ATG AAG ATG CCA AGG CGG 551 

RTQPDVQDGGRWLI EKRHRM 185 

AGG ACC CAG CCC GAT GTC CAG GAT GGA GGC CGC TGG CTC ATA GAA AAG CGA CAT CGC ATG 611 

QAIRDGLRGGAREDSLEDGV 205 

CAG GCC ATC CGG GAT GGG CTC CGT GGA GGC GCC CGT GAG GAC AGC CTG GAG GAT GGG GTC 671 

HIPQHAKLPVRKTHFLYILR 225 

CAT ATC CCC GAA CAC GCC AAG CTG CCT GTC AGA AAG ACA CAC TTT CTC TAC ATC CTC AGG 7 31 

P S Q Q L * 231 

CCA TCC CAA CAG CTG TAA GTGGGGACCAGATGTCCCACACCCTACCCCAACACCATATGGAAATAAAGGTTTTC 805 

TTACATCTAAAAAAAAAAAAAAAAAAAAAA 835 
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Figure 10 
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hsoggy MGEASPP APA RRHL . LVLLL LLSTLVIPSA AA PTHDADAO ESSLG LTGLQSL 

msoggy MCRL . RVLLL LLPLAFVSSS A LPIHDVDSO QNTSG FLGLQRL 

hdkk-3 MORLGATLLC LLLAAAVPTA PA PAPTATSA PVKPGPALSY PQEEATLNEM 

mdkk-3 MQRLGG I LLC TLLAAAVPTA PA PSPTVTWT PAEPGPALNY PQEEATLNEM 



61 

hsoggy LQGFSRLF . . LKGNLLRGID SL FSAPMDFRGL 

msoggy LQSFSRLF . . LKNDLLRDLDNF FSSPMDFRDL 

hdkk-3 FREVEELMED TQHKLRSAVE EMEAEEAAAK ASSEVNLANL 
mdkk-3 FREVEELMED TQHKLRSAVE EMEAEEAAAK TSSEVNLASL 



PGNYHKEENQ EHQLGNNTLS 

PRNFHQEENQ EHRMGNHTLS 

PPSYHNETNT DTK VGNNT I H 

PPNYHNETST ETRVGNNTVH 
* * * * 
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SHLQIDKMTD NKTGEVLISE NWASIQPAE GSFEGDLKVP RMEEKIeIaLVP IQKATDSFHT 

s _ h ^?? dkvtd nqtgevhise kveasiep.e ; rnpegdwkvp kvea^e^^p_vqkvtdslhp 

" V/dedcgp 

. . DEDCGP 



VHREIHKITN NQTGQMVFSE TVITSVGDEE' GR RSHE^Yl. 

VHQEVHKITN NQSGQWFSE TVITSVGDEE \ GK RSHEfcll 
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ELHPR . VAFW I IKLPRRRSH . . . . QDALEG GHWLS EKRHR 

E. . PRQVAFW IMKMPRRRTQ . . . . PDVQDG GRWL I EKRHR 

SMYCQFASFQ YTCQPCRGQR MLCTRDSECC GDQLCVWGHC 

TRYCQFSSFK YTCQPCRDQQ MLCTRDSECC GDQLCAWGHC 



LQAIRDGLRK . 
MQAIRDGLRG . 
TKMATRGSNG T 
TQKATKGGNG T 
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PTFVGSR DQDGEILLPR EVPDEYEVGS FMEEVRQELE DLERSLTEEM 
PAFVGSH DHSEESQLPR EAPDEYEDVG FIGEVRQELE DLERSLAQEM 



361 379 
hdkk-3 ALGEPAAAAA ALLGGEEI- 
mdkk-3 AFEGPAPVES . . LGGEEEI 



Figure 11 
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